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Abstract 

As newly synthesized glycoproteins move through the secretory pathway, the asparagine-linked glycan (N-glycan) 
undergoes extensive modifications involving the sequential removal and addition of sugar residues. These modifications are 
critical for the proper assembly, quality control and transport of glycoproteins during biosynthesis. The importance of N- 
glycosylation is illustrated by a growing list of diseases that result from defects in the biosynthesis and processing of N- 
linked glycans. The major rhodopsin in Drosophila melanogaster photoreceptors, Rhl, is highly unique among glycoproteins, 
as the N-glycan appears to be completely removed during Rhl biosynthesis and maturation. However, much of the 
deglycosylation pathway for Rhl remains unknown. To elucidate the key steps in Rhl deglycosylation in vivo, we 
characterized mutant alleles of four Drosopliila glycosyl hydrolases, namely ot-mannosidase-ll (a-Man-ll), a-mannosidase-llb 
(a-Man-llb), a p-W-acetylglucosaminidase called fused lobes (Fdl), and hexosaminidase 1 (Hexol). We have demonstrated 
that these four enzymes play essential and unique roles in a highly coordinated pathway for oligosaccharide trimming 
during Rhl biosynthesis. Our results reveal that a-Man-ll and a-IVlan-llb are not isozymes like their mammalian counterparts, 
but rather function at distinct stages in Rhl maturation. Also of significance, our results indicate that Hexol has a 
biosynthetic role in N-glycan processing during Rhl maturation. This is unexpected given that in humans, the 
hexosaminidases are typically lysosomal enzymes involved in N-glycan catabolism with no known roles in protein 
biosynthesis. Here, we present a genetic dissection of glycoprotein processing in Drosopliila and unveil key steps in N- 
glycan trimming during Rhl biosynthesis. Taken together, our results provide fundamental advances towards 
understanding the complex and highly regulated pathway of N-glycosylation in vivo and reveal novel insights into the 
functions of glycosyl hydrolases in the secretory pathway. 
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Introduction 

Many proteins that are synthesized in the endoplasmic 
reticulum (ER) and transported through the secretory pathway 
are posttranslationally modified with an asparagine (N)-linked 
carbohydrate moiety, termed an N-glycan (Figure 1). N-glycosyl- 
ation is key to the proper assembly, quality control, and transport 
of glycoproteins during biosynthesis [1,2]. Not only does the N- 
glycan direcdy reduce protein aggregation, but it also serves as a 
signal for interaction with key lectins and glycosyl hydrolases. N- 
glycosylation begins in the ER, where preformed GlcjMangGlc- 
NAc2 (Figure 1) is transferred, en bloc, to asparagine (Asn) residues 
on nascent polypeptide chains containing the consensus sequence, 
NXS/T [3,4]. As the newly synthesized glycoprotein moves 
through the secretory pathway, the oligosaccharide precursor 
undergoes extensive modifications involving sequential removal 
and addition of sugar residues in the Golgi [5] . AH of the enzymes 
that function in carbohydrate trimming during biosynthesis are 



exoglycosidases and thus require their substrate residue to be 
exposed at a terminal position on the N-glycan. Accordingly, 
carbohydrate trimming proceeds sequentially, with the most 
terminal residues being removed first. These modifications serve 
critical functions in glycoprotein recognition, folding, targeting, 
and degradation [1,6]. 

N-glycosylation in Drosophila melanogaster proceeds in a similar 
fashion to the well-characterized glycosylation pathways identified 
in mammalian systems, beginning with addition of the 
Glc3Man9GlcNAc2 precursor to newly synthesized proteins in 
the ER [2,7-9]. Likewise, steps in the subsequent processing and 
trimming of the N-glycan are highly conserved between humans 
and Drosophila [2]. Here, we show that there is at least one 
Drosophila homolog for nearly all of the human glycosyl hydrolase 
(GH) enzymes involved in N-glycosylation (Figures 2 and S2— 
S7). 

One notable diflFerence between vertebrate and invertebrate 
glycan processing is reflected in the relative abundance of the final 



PLOS Genetics | www.plosgenetics.org 



1 



May 2014 | Volunne 10 | Issue 5 | e1004349 



Glycosyl Hydrolases Key for N-Glycan Trimming during Rhi Biosynthesis 



Author Summary 

As newly synthesized glycoproteins move through the 
secretory pathway, the asparagine-linked glycan (N-glycan) 
undergoes extensive modifications involving the sequen- 
tial removal and addition of sugar residues. These 
modifications are critical for the proper assembly, quality 
control and transport of glycoproteins during biosynthesis. 
The importance of N-glycosylation is illustrated by a 
growing list of diseases that result from defects in the 
biosynthesis and processing of N-llnked glycans. The major 
rhodopsin in the Drosophila (fruit fly) eye, Rhl, is highly 
unique among glycoproteins, as the N-glycan appears to 
be completely removed during Rhl biosynthesis and 
maturation. However, much of the deglycosylation path- 
way for Rhl remains unknown. To elucidate the key steps 
in Rhl deglycosylation, we conducted a genetic dissection 
of glycoprotein processing in vivo. We have demonstrated 
that four glycosyl hydrolases play essential and unique 
roles in a highly coordinated pathway for N-glycan 
trimming during Rhl biosynthesis. Our results reveal novel 
insights into the functions of glycosyl hydrolases in the 
secretory pathway and provide fundamental advances 
towards understanding N-glycosylation in vivo. 



N-glycan structures produced on glycoproteins [2] . In vertebrates, 
glycoproteins typically harbor hybrid- or complex-type glycan 
structures, in which one or two additional N-acetylglucosamine 
(GlcNAc) residues serve as the building blocks for the elongation 
and elaboration of the N-glycan. In contrast, glycoproteins in 
Drosophila and other invertebrates typically have high- or pauci- 
mannosidic glycan structures, in which the GlcNAc residues are 
removed prior to elongation, and thus elaboration of the N-glycan 
begins directly from the trimannosyl core. These differences are 
explained by the presence of a P-jV-acetylglucosaminidase in 
Drosophila, called fused lobes (Fdl), which functions to remove the 
terminal GlcNAc residue. Accordingly, mutations in Drosophila 
fused lobes lead to the formation of the hybrid- and complex-type 
glycans that are more typical of vertebrates [10]. 

Mutations in other Drosophila glycosyl hydrolases that function in 
the N-glycan processing pathway have also been described in vivo. 
The first a-mannosidase to be cloned and characterized in 
Drosophila was a-mannosidase-I (a-Man-I, G42275) [7]. NuU 
mutations in cc-man-I (also known as masl) were shown to cause 
developmental defects in the embryonic peripheral nervous system 
(PNS), in the wing, and in the adult eye [7] . It was recently shown 
that a-mannosidase-II (a-Man-II, CGI 8802) and ot-mannosidase- 
Ilb (ot-Man-IIb, CG4606) are required for rhodopsin deglycosyla- 
tion in Drosophila [11]. The authors further demonstrated that a- 
Man-II activity is strictly regulated by a phosphatase, Drosophila 
metaUophosphoesterase (dMPPE). In addition to mutant analysis, 
Drosophila ot-Man-II has been the subject of extensive structural 
analysis over the last decade. Its crystal structure now serves as the 
standard model for all Class II a-mannosidases [12-22]. While 
some of the GH enzymes involved in N-glycosylation have been 
isolated and well characterized in Drosophila, many remain 
unannotated and/ or uncharacterized. 

The importance of N-glycosylation is illustrated by a growing 
list of diseases that result from defects in the biosynthesis and 
processing of N-linked glycans. Congenital disorders of glycosyl- 
ation (CDG) are a group of autosomal recessive diseases that result 
from defects in the synthesis, attachment, and/or processing of 
glycans [23]. Patients with CDG manifest a variety of symptoms 
and many of these diseases affect the eye. Patients with CDG Type 



la and Ic may develop progressive vision loss, retinitis pigmentosa 
(RP), impaired night vision, convergent strabismus, abnormal eye 
movements, as well as other ocular pathologies [24—26]. Several 
additional diseases have been associated with defects in N-glycan 
processing, such as a-mannosidosis and fi-mannosidosis, which are 
autosomal recessive lysosomal storage diseases caused by genetic 
disruption of an a-mannosidase (MAN2B1) or fi-mannosidase 
(MANBl) enzyme, respectively [27,28] (Figure 2). These disorders 
are characterized by the accumulation of undegraded N-glycan 
products, which retain either ot- or fi-linked mannose residues, 
respectively. Patients often suffer from mental retardation, among 
a variety of other clinical manifestations [27,29]. Other symptoms 
include late-onset retinal dystrophy, peripheral neuropathy, 
hearing loss, speech impairment, epileptic encephalopathy, 
angiokeratome, and skeletal/facial dysmorphism [30,31]. Finally, 
two loci encoding the hexosaminidase enzymes, HEXA and 
HEXB (Figure 2), have been direcdy linked to Tay-Sachs and 
SandhoflF disease [32]. 

There is an overwhelming diversity of oligosaccharide struc- 
tures that can be generated during glycoprotein processing, thus 
determining the roles of distinct oligosaccharide structures in vivo 
is a formidable challenge. Limited clues have come from studies 
of cell lines that are defective in certain processing enzymes [33]. 
However, these studies present some limitations, as cell culture 
experiments do not always mimic the complex environments that 
exist in vivo, in multi-cellular organisms. Other insights have come 
from studies involving the treatment of cells or animals with 
inhibitors that block various enzymatic steps in N-glycan 
processing [34]. The caveat with these studies is that the 
inhibitors often target multiple enzymes, complicating interpre- 
tation of the results [35]. One of the most powerful techniques for 
elucidating the specific roles of individual processing enzymes in 
animals involves the generation and analysis of the corresponding 
mutants in vivo. Such a systematic genetic dissection of N- 
glycosylation is entirely feasible in Drosophila. Additionally, the eye 
is not required for viability and therefore null mutations in the 
eye can be easily investigated to uncover the roles of glycosyl 
hydrolases in vivo. 

In this study, we identify four glycosyl hydrolases in Drosophila 
that are critical for N-glycan processing during protein biosyn- 
thesis, namely a-mannosidase-II, a-mannosidase-IIb, a P-jV- 
acetylglucosaminidase called fused lobes, and hexosaminidase 1. 
We have used the major visual protein in Drosophila, rhodopsin 1 
(Rhl), as a substrate for the identification and characterization of 
mutants that are defective in specific N-glycan processing 
enzymes in the eye. Rhl is highly unique among glycoproteins 
as we, and others, have shown that the N-glycan is extensively 
trimmed as Rhl matures through the secretory pathway during 
biosynthesis [36-39]. Here, we demonstrate that Rhl deglyco- 
sylation requires the sequential actions of a defined set of enzymes 
from numerous glycosyl hydrolase families, unveiling a highly 
coordinated pathway for Rh 1 maturation. Furthermore, we show 
that in order to trim the fmal mannose residues (M3, M4, and 
M5) and the final N-acetylglucosamine residues (Nl and N2) at 
the base of the N-glycan (Figure 1), Rhl requires the actions of 
several lysosomal and plasma membrane enzymes that, to date, 
have not been characterized within the context of N-glycan 
processing in the secretory pathway. Therefore, our analysis of 
Rhl deglycosylation has uncovered novel roles for these glycosyl 
hydrolase enzymes during protein biosynthesis. Our findings on 
the glycosyl hydrolases in Drosophila provide insights for future 
studies aimed at unraveling the complex process of N-glycosyl- 
ation in vivo, as well as for understanding mechanisms that 
underlie human disease. 



PLOS Genetics | www.plosgenetics.org 



2 



IVlay2014 | Volume 10 | Issue 5 | e1004349 



Glycosyl Hydrolases Key for N-Glycan Trimming during Rhi Biosynthesis 



Glucosidase I 



Glucosidase II 




Golgi Man lAi 
Golgi Man IB 
Golgi Man IC, 



Golgi Man lA 
Golgi Man IB 
Golgi Man IC 



Major lysosomal 
a-Man 



Minor lysosomal 
a-Man 



Lysosomal 
p-Man 


> 






Lysosomal 
P-Hexo 




> 



N2 



P1,2 



EndoH 



N1 



A = Glucose (Glc, G) 

O = Mannose (Man, M) 

D = N-acetyl glucosamine (GlcNAc, N) 



PNGase F 



Asn 



Figure 1. A schematic of the asparagine (Asn, N)-linked oligosaccharide precursor, Glc3Man9GlcNAc2. The N-glycan contains three 
glucose residues (triangles, G), nine mannose residues (circles, IV13-1 1), and two p-N-acetylglucosamine residues (squares, N1-2). A, B, and C refer to 
the major branches on the N-glycan structure. Colors complement the enzyme list provided in Figure 2, indicating the glycosyl hydrolases (GH) 
responsible for trimming each residue. The a1,2-linked mannose residues (IVI6, M9, IV110, and Ml 1) are removed by Class I a-mannosidases (GH Family 
47). The 0(1,3- and a1,6-linked mannose residues (IV14, IV15, M7, and M8) are removed by Class II a-mannosidases (GH Family 38). M3 is attached to the 
N-glycan by a pi,4-linkage and is removed by a p-mannosidase (GH Family 2). Finally, the terminal GlcNAc residue is attached via a p1,2-linkage and is 
removed by a p-hexosaminidase (GH Family 20). IVIan (mannosidase), Hexo (hexosaminidase). 
doi:1 0.1 371 /journal.pgen.1 004349.g001 



Results 

Identification of mutant alleles affecting N-glycosylation 
of Rhi 

Using heat pulse-chase experiments, we have shown that Rh 1 is 
initially synthesized in high-molecular weight (MW) glycosylated 
forms, that are processed down to the mature form (Figure 3A) 
[40]. Therefore, unlike all other known glycoproteins, the N- 
glycan on Rhi is trimmed but not subsequently built upon to form 
larger carbohydrate structures [36] . Because of this highly unique 
characteristic, Rhi offers a powerful model for studying the 
glycosyl hydrolases without their action being obscured by 
subsequent carbohydrate addition. 

To identify genetic loci critical for Rhi deglycosylation during 
biosynthesis, we analyzed flies from the Zuker collection of 



EMS-generated alleles [41] for defects in carbohydrate trimming 
by monitoring the molecular weight (MW) of Rh 1 . In wUd-type 
adult flies, Rhi is detected in a mature 34 kD form (Figure 3B). 
Mutations that result in defective trimming of the oligosaccharide 
chain during Rh I biosynthesis lead to the accumulation of Rh 1 in 
abnormal high MW forms. For example, NinaA is a chaperone 
required for the exit of Rhi from the ER [39,42-44]. Accordingly, 
in ninaA^''^ mutants, Rhi is detected in an immature, 38 kD 
hyperglycosylated form (Figure 3B) [39,45]. 

In our EMS screen, we observed defects in Rhi deglycosylation 
in two mutant alleles and performed deficiency mapping, followed 
by sequence analysis, to identify each mutant locus. We identified 
a point mutation that leads to a G572E substitution in CG4606, 
which encodes a Class II a-mannosidase enzyme from GH Family 
38, termed a-niannosidase-IIb (ot-Man-IIb). We also identified a 
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Figure 2. N-glycan processing enzymes in humans and Drosophila. Human and Drosophila enzymes in numerous glycosyl hydrolase (GH) 
families are involved in the processing and/or catabolism of N-glycans. These proteins are divided into six major groups and are listed in the order in 
which they are thought to function in the cascade (See Figure 4). (1) Glucosidase I and the a-subunit of glucosidase II are from GH Families 63 and 31, 
respectively (tan). The |3-subunit of glucosidase II is not a GH enzyme (tan). (2) The Class I a-mannosidases from GH Family 47 can be classified into 
three functionally distinct subgroups: Subgroup A includes the ER 0(1,2-mannosidases (orange), Subgroup B includes the Golgi ot1,2-mannosidases 
(yellow), and Subgroup C includes the EDEMs (green). (3) GlcNAc-transferase is not a GH enzyme (dark blue). (4) The Class II a-mannosidases from GH 
Family 38 can also be classified into several functionally distinct subgroups: Subgroup A includes the Golgi 0(1,3(1,6)-mannosidases (light blue). 
Subgroup B includes the lysosomal a-mannosidases (purple), and Subgroup C includes an ER/cytosolic a-mannosidase, which is not found in 
Drosopfiila (grey). (5) The p-mannosidases from GH Family 2 (pink) and (6) the hexosaminidases from GH Family 20 (brown) are also listed. Accession 
numbers presented here indicate the protein sequences that were used for all amino acid alignments and sequence analyses performed in this study. 
Man (mannosidase), Hexo (hexosaminidase), L (lysosomal). 
doi:1 0.1 371 /journal.pgen.1 004349.g002 



substitution leading to a pre-mature stop codon (Q592X) in 
CGI 3 18, which encodes a P-N-acetylglucosaminidase enzyme 
from GH Family 20, termed hexosaminidase 1 (Hexol). 

Identification of the n-man-IIb''''^^'^^ and hexol^^^'^ mutants 
served as the starting point for a larger analysis of the role of 
glycosyl hydrolases from these GH families in Rhl deglycosyla- 
tion. Specifically, we focused on Class II a-mannosidases from GH 
Family 38 [46] and hexosaminidases from GH Family 20 [47] 
with potential roles in N-glycosylation. We identified additional 
mutant alleles of a-man-IIh and hexol, as well as mutant alleles 
corresponding to a second Class II ot-mannosidase from GH 
Family 38, ot-mannosidase-II (a-Man-II, CGI 8802), and a second 
hexosaminidase from GH Family 20, termed fused lobes (Fdl, 
CG8824). Here, we characterize the roles of a-Man-II, a-Man- 
Ilb, fused lobes, and Hexol during Rhl deglycosylation in vivo 



(Figure 3B). Again, the ninaA ' mutant is shown for reference, as 
Rhl is retained in the ER in this mutant, and thus N-glycan 
trimming is blocked at a very early stage during biosynthesis. 
Accordingly, mutations in ninaA lead to the accumulation of a 
hyperglycosylated, immature form of Rlil that is significandy 
larger than the mutant forms observed in the glycosyl hydrolase 
mutants, which function for N-glycan trimming downstream in the 
Golgi (Figure 3B). 

To determine the extent of Rhl deglycosylation during its 
biosynthesis, we performed digestions with endoglycosidase H 
(Endo H) and peptide N-glycosidase F (PNGase F) and found that 
the mature form of Rh 1 is completely insensitive to treatment with 
either enzyme (Figiare 3C). These results suggest that mature Rhl 
protein is completely void of carbohydrate. However, it is possible 
that mature Rh 1 is highly trimmed, but retains a small amount of 
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Figure 3. Rhi is deglycosylated in a step-wise fashion. (A) Western blot of Rhi protein from transgenic flies carrying the P[hs:Rhl-bov] 
construct, probed with an antibody (1D4) directed to the bov epitope tag on Rhi. Following a 1 hour heat pulse at 37 C, flies were shifted to 22 C 
and assayed at the indicated times (T, in hours). Lanes 1 and 2 show Rhi -bov under the control of the endogenous Rhi promoter, expressed in either 
ninaA^^^ mutant or wild-type (WT) flies. The epitope tag adds ~2 kD to Rhi, such that the immature forms are detected at 42 and 40 kD and the 
mature form is detected at 36 kD. Five fly heads were loaded per lane, with the exception of lane 2 (WT), in which 1/6 head was loaded. (B) Western 



blot of Rhi protein from wild-type (WT), ninaA^'^^, «/^°^"', a-man-lf^ 



i-man-llb"'"^'', and hexof'""" mutant flies. One half of a fly head was 



loaded per lane, with the exception of lane 2 (ninaA), in which 15 heads were loaded per lane. (C) Western blot of Rhi protein from Canton S. wild- 
type tissue, treated {+) with either Endo H or PNGase F enzyme. (D) Western blot of Rhi protein from a-man-lf'''''"'^'' mutant tissue (Lane 1), treated 
(+) with either Endo H (Lane 2) or PNGase F (Lane 3), and compared alongside a 1-head wild-type sample (Lane 4, WT). 
doi:1 0.1 371/journal.pgen.1 004349.g003 
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carbohydrate that is not resolved by SDS-PAGE. To determine 
whether we are able to detect single oligosaccharide differences via 
Western blot analysis, we took advantage of Rhl's sensitivity to 
both Endo H and PNGase F in the a-man-Il mutants. Endo H 
cleaves between the first and second N-acetyl glucosamine 
(GlcNAc) residues, Nl and N2, leaving Nl attached to the protein 
(Figure 1). In contrast, PNGase F cleaves between Nl and the 
asparagine residue on Rhl, thus removing the final GlcNAc 
residue (Figure 1). Therefore, following PNGase F digestion, no 
carbohydrate remains. We compared the MW of Rhl protein in 
a-man-II mutant tissue that had been treated with each enzyme. 
Here, we show that when o; -man-// mutant fly extracts were treated 
with Endo H, the Rhl protein was slightly larger than when 
treated with PNGase F (Figure 3D). These results demonstrate that 
we can detect the presence or absence of a single GlcNAc residue. 
Further, the MW of mature Rhl protein in wild-type flies is 
indistinguishable from the Rhl protein in the PNGase F-treated 
sample (Figure 3D). These results support the concept that the 
final GlcNAc residue has been removed from mature Rhl and are 
consistent with the hypothesis that Rhl is fuUy deglycosylated 
during biosynthesis. 

The results presented here demonstrate that Rhl deglycosylation 
is a highly complex process involving defined and sequential 
trimming steps, carried out by a cascade of glycosyl hydrolase 
enzymes. In addition to ot-Man-U, a-Man-IIb, fused lobes, and 
Hexol, we have identified numerous glycosyl hydrolases in 
Drosophila with potential functions in N-glycosylation. The Drosophih 
glycosyl hydrolases that we identified displayed a high degree of 
amino acid identity with corresponding enzymes in humans, 
allowing us to place them into well-defined families and subfamilies 
(Figures 2 and S2, S3, S4, S5, S6, S7). Therefore, in addition to our 
characterization of a-Man-II, a-Man-Ub, fused lobes, and Hexol, 
we provide an informatics-based analysis of the fuU complement of 
human and Drosophila glycosyl hydrolases involved in N-glycan 
processing (see Supporting Information and Figure 2). 

Early steps in N-glycan processing 

N-glycan processing begins with the sequential removal of three 
terminal glucose residues from branch A by a-glucosidase I and a- 

glucosidase II (Figures 1 , 2 and 4) [48] . Following glucose removal, a 
family of ER and Golgi al,2-mannosidases collectively work to 
cleave the four al,2-linked mannose residues from the N-glycan 
structure to yield Man5GlcNAc2 (Figures 1 and 4) [49]. These 
enzymes belong to GH Family 47 and are also referred to as Class I 
al,2-mannosidases (Figure 2) [50]. Following trimming to Manj- 
GlcNAc2, the classical pathway for oligosaccharide maturation 
involves the action of GlcNAc-transferase I, which functions to add 
a single |31,2-linked N-acet}'lglucosamine (GlcNAc) to the terminal 
mannose (M4) on branch A (Figures 1, 2 and 4) [51,52]. The 
presence of the terminal GlcNAc residue serves as a signal for 
additional trimming events mediated by a group of Class II Qf- 
mannosidases belonging to GH Family 38 (Figure 2) [35,51,53]. 
Class II a-mannosidases have complementary specificity compared 
to the Class I enzymes from GH Family 47, primarily cleaving 0(1,3- 
and Qtl,6-linked mannose residues from the N-glycan structure [54], 
as opposed to Q(l,2-linked mannose residues (Figure 1). We have 
identified Drosophila homologs for each of the components in this 
pathway (Figures 2 and 4), and provide detailed amino acid (aa) 
alignments and analyses in Supporting Figures S2-S5. 

Drosophila ot-Man-ll functions upstream from a-Man-llb 
for Rhl maturation 

The presence of the terminal GlcNAc residue, added by 
GlcNAc-transferase I, serves as a signal for additional trimming 



events mediated by a group of Class II a-mannosidases belonging 
to GH Family 38 (Figure 4) [35,51,53]. The first trimming events 
mediated by the Class II enzymes involve the sequential removal 
of two mannose residues, M7 and M8 (Figure 1), from the 
GlcNAc iMan5GlcNAc2 structure [35]. Removal of these residues 
represents the "committed" step in N-glycan processing and is 
responsible for the conversion from high-mannose, Endo H- 
sensitive structures, to complex-type mannose structures that are 
insensitive to Endo H [3,55]. This step occurs in the Golgi and is 
mediated by a pair of isozymes from GH Family 38 (Subgroup A) 
termed a-mannosidasc II (a-Man II) and a-mannosidase IIx (ot- 
Man IIx) in mammals [56]. These enzymes are encoded by 
MAN2A1 [57] and MAN2A2 [58] in humans, respectively 
(Figure 2). We have identified two GH Family 38 members in 
Drosophila from Subgroup A, namely a-marmosidase-II (a-Man-II, 
CGI 8802) and a-mannosidase-IIb (a-Man-IIb, CG4606) 
(Figure 2). 

The evolutionary history between the human and Drosophila 
enzymes may yield insight into their potential functions. The 
mammalian enzymes, a-Man-II and a-Man-IIx, arose from a 
fairly recent gene duplication event (Figure S5A) and are isozymes, 
with identical substrate specificities. Therefore, these enzymes 
have yet to undergo functional specialization. They have, 
however, developed tissue-specific expression patterns [57,59]. 
Based on aa identity, Drosophila a-Man-II appears to represent the 
ortholog of these isozymes (Figure S5A). Drosophila a-Man-II 
displays 40% overall aa identity with both human Man II and 
Man IIx, and 56-57% aa identity within the GH Family 38 N- 
terminal catalytic domain (Figure S5B). By comparison, Drosophila 
a-Man-IIb is significantly more divergent (Figure S5A), displaying 
only 33-34% overall aa identity with the human enzymes and only 
32-46% aa identity within the GH Family 38 N-terminal catalytic 
domain (Figure S5B). It is therefore possible that Drosophila a-Man- 
IIb has evolved a specialized function, unique from the other three 
mannosidases. To investigate this possibility and to further 
understand the role of Drosophila a-Man-II and a-Man-IIb in 
Rhl biosynthesis, we assessed defects in Rhl maturation in mutant 
alleles corresponding to these loci. 

Mutations in Drosophila oc-man-II are homozygous lethal. 
Therefore, we employed a variety of techniques to obtain viable 
mutant alleles. First, we generated mosaic clones of a homozygous 
lethal P-element allele, a-man-lf^^"'^* , in the eye, using a FLP/ 
FRT recombination method [60]. Second, we utilized tissue- 
targeted RNA interference (RNAi) to reduce tx-man-ll transcript 
levels, specifically in the eye, using the a-man- ir"-'" UAS-RNAi 
construct and an eye-specific GAL4 driver line [61]. Finally, we 
obtained two hypomorphic P-element alleles, a-man-lf'*^'" and a- 
man-lt"''''^^ , whose insertional sites lie in non-coding regions 
rendering them homozygous viable. All four alleles are depicted in 
Figure 5A. Mutations in a-man-II lead to defects in the maturation 
of Rh 1 protein, as indicated by the accumulation of a high MW 
form of Rhl in all four mutant alleles (Figure 5C, left). Th(; mosaic 
allele (a-man- 

ljLLoio94^ and RNAi allele (a-man-ir ) are strong 
alleles as Rhl is present exclusively in a high MW form, with no 
mature Rhl detected (Figure 5C, left). In contrast, some Rhl 
protein is present in the mature, wild-type form in the 

hvpomorphic P-element mutants (a.-man-lf''*'"" and a-man- 
li""''"'^) (Figure 5C, left). These results indicate that a-Man-II is 
required for proper deglycosylation of Rhl. 

For the a-man-IIb locus, we obtained a single EMS-generated 
allele, a-man-IIb''^^^^ , and four P-element alleles, a-man-WP^^^* , 
a-man-lw , a-man-lib , and a-man-Uh , ail of which 
are depicted in Figure 5B. As with a-man-II, mutations in a-man-IIb 
lead to defects in Rhl maturation, as indicated by the 
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Figure 4. Pathway for Rhi oligosaccharide trimming. Rh1 deglycosylation is a highly coordinated pathway that requires the sequential actions 
of a large number of glycosyl hydrolase (GH) enzymes and families. Here, we illustrate the major steps that are thought to occur during Rhi 
deglycosylation, beginning with the precursor oligosaccharide, GIC3Man9GlcNAc2. The enzymes responsible for equivalent hydrolysis reactions in 
mammals are shown above each arrow. The Drosophila enzymes predicted to perform these steps during Rhi maturation are indicated below each 
arrow. Of particular note, the step mediated by Drosophila fused lobes is unique to insects and plants. Removal of this terminal GlcNAc residue has 
not been shown to occur in mammals. This explains why elongation of human glycoproteins leads to the formation of complex-type 
oligosaccharides, whereas elongation of Drosophila glycoproteins leads to the formation of pauci-mannosidic structures. Importantly, the subsequent 
steps contained within the grey box represent a novel pathway for Rhi maturation. That is, the mammalian enzymes responsible for these cleavage 
events have only been characterized in the context of carbohydrate catabolism in the lysosome, but have no known roles in glycoprotein processing. 
Here, we propose that the corresponding enzymes in Drosophila play a critical role in Rhi deglycosylation during biosynthesis, leading to the 
complete deglycosylation of Rhi. G (glucose), M (mannose), N (p-W-acetylglucosamine, GlcNAc), Man (mannosidase), Hexo (hexosaminidase). 
doi:l 0.1 371/journal.pgen.1 004349.g004 



accumulation of a high MW form of Rh 1 in all five mutant alleles 
(Figure 5C, right). In the strongest alleles, Rhi is present 
exclusively in a high MW form, whereas in others a small amount 
of Rhi is detected in the wUd-type form (Figure 5C, right), 
indicating that these weaker mutations are hypomorphic. Indeed, 
certain alleles are tcmpc-rature-sensitive and these data are sho^\ n 
in Figure S 1 . These results indicate that a-Man-IIb is also required 
for proper deglycosylation of Rhi. 

The phenotypes observed in the strongest alleles of both a-man- 
II and a.-mm-IIb show that Rhi is present exclusively in a high 
MW form, with no mature Rhi detected (Figure 5C). These 
results suggest that, unlike the human a-Man II and a-Man IIx 
is()Z5'mes, Drosophila a-Man-II and a-Man-IIb do not perform 
redundant functions, but rather are uniquely required for distinct 
steps in N-glycan processing during Rhi maturation. The lethality 
of d-man-II mutants also supports this hypothesis. Still additional 
support for their independent roles during Rhi biosynthesis comes 
from a close comparison in the MW of Rhi detected in a-man-II 
versus a-man-IIb mutants. Specifically, the high MW form of Rhi 
in a-man-II mutants is shghtly larger than the high MW form 
observed in ix-man-IIb mutants (Figure 5D) [11], indicating that 
further trimming has occurred in a-man-IIb mutants. These data 
support the hypothesis that Drosophila ot-Man-II functions upstream 
from a-Man-IIb for carbohydrate trimming during Rhi biosyn- 
thesis. 

The hypothesis that a-Man-II functions upstream from a-Man- 
IIb is further supported by the restjlts of Endo H and PNGase F 
digestions on the mutant tissues. In all four alleles of a-man-II and 
all five alleles of a-man-IIb, the high MW form of Rhi was sensitive 
to digestion with PNGase F, demonstrating that the abnormal, 
high MW Rhi protein was indeed hyperglycosylated (Figures 5E 
and 5F). The a-ma«-// mutants were also sensitive to digestion with 
Endo H (Figure 5E), indicating that the M7 and M8 mannose 
residues had not yet been removed from the N-glycan. These 
findings are consistent with the predicted role for a-Man-II in the 
removal of M7 and M8. This is also the function of the 
mammalian isozymes, a-Man II and a-Man IIx [56] . In contrast, 
the a-man-IIb mutants were insensitive to treatment with Endo H 
(Figure 5F), indicating that the M7 and M8 residues had already 
been removed and that Rh 1 had progressed in the Golgi past this 
step. Taken together, these data strongly support the notion that a- 
Man-IIb functions downstream from a-Man-II, and downstream 
from M7 and MB removal, for Rhi deglycosylation during 
biosynthesis. Again, the finding that Rhi is detected exclusively in 
a high MW form in strong alleles of a-man-II and a.-man-IIh 
indicates that, not only do they function at unique steps during 
Rhi deglycosylation, but also that there is no functional 
redundancy between the two Drosophila enzymes. These results 
are consistent with the highly divergent amino acid sequence of 
Drosophila a-Man-IIb, compared with the other members of GH 
Family 38, Subgroup A (Figure S5). Although our results clearly 
indicate a role for a-Man-IIb in Rhi deglycosylation, downstream 



from a-Man-II and removal of M7 and M8, the specific role of this 
enzyme in trimming is not yet known (Figure 5G). These data are 
consistent with a possible role for a-Man-IIb in trimming M5. 

Drosophila fused lobes is essential for Rhi 
deglycosylation 

Following removal of the M7 and M8 residues, the N-glycan 
trimannosyl core is built upon to form a variety of N-glycan 
structures. In mammals, the N-glycan typically retains the terminal 
GlcNAc residue (attached to M4) and is additionally modified with 
fucosyl, galatosyl, and/or sialyl residues to form complex-type N- 
glycan structures in the Golgi (Figure 4) [62,63]. In contrast, insect 
cells hydrolyze this terminal GlcNAc by the action of a P-N- 
acetylhexosaminidase, leading to the generation of paucimanno- 
sidic N-glycan structures (Figure- 4) [2,64]. The hexosaminidase 
responsible for this step in Drosophila, termed fused lobes (CG8824), 
is a member of GH Family 20 [10]. This gene was originally 
named due to the mushroom body Jiised lobes phenotype in the 
corresponding mutant [65]. Interestingly, mammalian P-hexosa- 
mindases from GH Family 20 are generally present as soluble 
proteins in the lysosome and have well-characterized roles in the 
catabolism and recycling of sugar residues. Unique among the P- 
hexosaminidases, Drosophila fused lobes is a membrane-associated 
protein that has been implicated as a processing enzyme 
responsible for removing the terminal GlcNAc residue attached 
to the M4 residue on nascent glycoproteins (Figure 4). This event 
was proposed to be a routine step during glycoprotein maturation 
in the secretory pathway [10]. 

To investigate the role of fused lobes in Rhi deglycosylation, we 
obtained a single P-element dSkA&, fdf^'" , as well as two RNAi 
alleles, /fl'f'"^'^ and /rff^®'', all of which are shown in Figure 6A. In 
all three fused lobes mutants, Rhi was detected in an abnormal high 
MW form (Figure 6B). These results indicate that fused lobes is 
required for the proper maturation of Rhi during biosynthesis. 
The high MW form of Rhi detected in fused lobes mutants was 
sensitive to digestion with PNGase F, indicating that the protein 
was hyperglycosylated (Figure 6C). In contrast, Rhi was 
insensiti\c to treatment with Endo H in the fused lobes mutants, 
indicating that M7 and M8 had been removed from the N-glycan 
in the Golgi (Figure 6C). These results are consistent with the 
predicted role of fused lobes, downstream from a-Man-II (and 
removal of M7 and MS), in the Golgi during N-glycan trimming. 

Surprisingly, the MW of Rhi in fiised lobes mutants was larger 
than the MW observed in a-man-II mutants (Figure 3B). Given 
that fused lobes functions downstream from a-Man-II, these data 
suggest that the N-glycan not only retains its terminal GlcNAc 
residue in the absence of fused lobes, but must also undergo some 
form of elongation. Elongation or elaboration of the N-glycan, 
triggered by the abnormal retention of GlcNAc, would account for 
the larger size of Rhi in fiised lobes mutants. Our data are consistent 
with previous findings that mutations in Drosophila fused lobes lead to 
the abnormal generation of hybrid- and complex- type glycans that 
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Figure 5. Drosophila a-Man-ll functions upstream from a-Man-iib during Rlil deglycosylation. Intron/Exon structures for (A) a- 
mannosidase-ll {a-man-ll, CGI 8802) and (B) oi-mannosidase-llb {a-man-llb, CG4606) are shown to indicate the alleles used in this study: P[EP]a-man- 
lf^^\ PDCPh-man-lf"^^^ Pbac[SAstopDsRed]a-man-ll^^'"°^, P[GD2875]^^", PbaclPB]<x-man-llb'"^" , Pbac[RB]a-man-llb^"^\ PbadWHla-man-llb"^"^ , 
Pbac[WH]a-man-llb'°^"'', and a-man-llb''"^^. Blue = coding sequence, red = additional mRNA, yellow = RNAi target. (C) Western blots of Rhi protein 
from a-man-ll (Left) and a-man-llb (Right) mutant flies. Left Lanes: (1) Wild-type (WT), (2) x-man-lf-"'"^", (3) x-man-ir^^^^, (4) a-man-ll^^', (5) a-man- 
Ild06422^ and (6) WT. Right Lanes: (1) WT, (2) a-man-llb'^^"^, (3) a-man-llb'"^^^", (4) a-man-llb"'^"^ (5) a-man-llb^°"^^ (6) a-man-llb""^^, and (7) WT. One 
head was loaded per lane, with the exception of lane 3 on the left side {a-man-lf^^^^, in which 4 heads were loaded. (D) Western blot, comparing the 
molecular weight of Rhi in the a-man-l^^"^"^ and tx-man-llb"'^^^'' mutants, indicating that Rhi is slightly larger in a-man-ll mutants. One half of a head 
was loaded per lane. (E and F) Western blot of Rhi protein from the a-man-//and a-man-llb alleles described in (C), treated (+) with either Endo H or 
PNGase F enzyme and labeled for Rhi. (G) Proposed role for Drosophila a-Man-ll in trimming the M7 and IVIS mannose residues (blue) during Rhi 
biosynthesis. Our data indicate that Drosophila a-Man-llb functions downstream at a step that is distinct from a-Man-ll during Rhi deglycosylation. 
doi:1 0.1 371/journal.pgen.1 004349.g005 



are more typiccdly seen in vertebrates [10]. Therefore, mutations 
in Jiised lobes may promote the formation of a complex N-glycan 
structure on Rhi. 

Trimming the trimannosyl core during Rhi 
deglycosylation 

Whether the terminal GlcNAc residue is removed (as it is in 
plants and insects) or retained (as it is in mammals), the subsequent 
stages in N-glycosylation almost always involve elongation and 
elaboration of the N-glycan structure by the addition of diverse 
sugar moieties (Figure 4). In stark contrast, the N-glycan on 
Drosophila Rhi is further trimmed during biosynthesis (Figure 3 A) 
[36-39]. Indeed, our data are consistent with the hypothesis that 
the mature form of Rhi is completely void of carbohydrate 
(Figures 3C and 3D). Step-wise reduction of the remaining 
trimannosyl core would theoretically involve removal of the 
residual a-linked mannose residues, M4 and M5, the fS-linked 
mannose, M3, and the two GlcNAc residues at the base of the N- 
glycan structure, Nl and N2 (Figure 1). Glycosyl hydrolase 
enzymes capable of performing these steps have been identified, 
but they are generally present as soluble proteins in the lysosome. 

Specifically, there are two lysosomal a-mannosidases from GH 
Family 38 Subgroup B: the major lysosomal a-mannosidase 
(IVIAN2B1) [66-68] and the minor lysosomal a-mannosidase 
(MAN2B2) (Figure 2) [69-71]. These enzymes are required for 
removal of M4 and M5, respectively, during catabolism of the N- 
glycan (Figure 1). There is also a lysosomal fS-mannosidase from 
GH Family 2 (Figure 2) that is iii\ olved in the removal of the fi- 
Unked M3 residue from N-glycan structures during catabolism 
(Figure 1) [31,72,73]. Finally, at the very base of the N-glycan 
structure lie two GlcNAc residues, Nl and N2 (Figure 1). In 
mammals, the bond between these residues is cleaved by a pair of 
[(-hexosaminidase enzymes from GH Family 20, Hex A and Hex 
B [32,74-76]. Again, this particular reduction occurs in the 
context of N-glycan catabolism and the recycling of sugar residues 
in the lysosome. We have identified Drosophila homologs for each of 
the components in this catabolic pathway (Figures 2 and 4), and 
provide detailed amino acid (aa) alignments and analyses in 
Supporting Figures S5— S7. 

The finding that none of these lysosomal enzymes had predicted 
functions in the biosynthetic processing of glycoproteins presented 
a conundrum for the case of Rhi deglycosylation, given that the 
trimannosyl core structure is removed during Rhi maturation. 
One possible explanation is that the residual trimannosyl core is not 
removed in a step-wise fashion by exo-acting enzymes, but rather 
an endo-acting hydrolase removes the remaining structure from 
Rhi in bulk. If this were the case, the lysosomal enzymes would 
not be required for Rhi deglycosylation. Another possible 
explanation is that some of the enzymes previously characterized 
as 'lysosomal', have in fact evolved functions in glycoprotein 
processing and are involved in the step-wise deglycosylation of 
Rhi. Indeed, non-traditional roles have been assigned to a 



number of enzymes formerly thought to function solely in the 
lysosome. For example, a number of 'lysosomal' enzymes have 
been shown to localize to the sperm plasma membrane and 
facilitate sperm-egg interaction during fertilization [77]. Here, we 
present evidence that one such enzyme in Drosophila, Hexo 1 , plays 
a role in Rhi biosynthesis in the secretor)' pathway. Our results 
lend support to the hypothesis that, indeed, Rhi is deglycosylated, 
residue by residue, until the protein is completely void of 
carbohydrate. 

Drosophila Hexol is uniquely required for Rhi 
deglycosylation 

In humans, there are two loci encoding [3-jV-acetylhexosamini- 

dase enzymes from GH Family 20, termed HEXA and HEXB 
(Figure 2). The Hex A enzyme is a heterodimer (a[5) containing 
one 0!-subunit encoded by the HEXA locus [75] and one P-subunit 
encoded by the HEXB locus [76]. In contrast, the Hex B enzyme 
is a homodimer of P-subunits (PP), and thus only requires the 
HEXB gene [32]. These enzymes have been extensively investi- 
gated, due to their known roles in Sandhoflf and Tay-Sachs 
disease, and function to remove terminal GlcNAc or GalNAc 
residues from a variety of carbohydrate and glycolipid structures. 
We have identified three hexosaminidases belonging to GH 
Family 20 in Drosophila, namely hexosaminidase 1 (Hexol, 
CG1318), hexosaminidase 2 (Hexo2, CG1787), and a third 
hexosaminidase characterized above, fused lobes (Fdl, CG8824) 
(Figure 2). ^'Uthough P-hexosamindases from GH Family 20 are 
generally thought of as lysosomal enzymes, involved in the 
recycUng of sugar residues, numerous non-traditional roles have 
been identified for these 'catabolic' enzymes. Both the human and 
Drosophila hexosaminidases have been identified on the plasma 
membrane of various cell types [47,78] and, of particular interest, 
human Hex B has been shown to localize to the sperm acrosome, 
which is derived from the Golgi apparatus [79]. Given these 
uncharacteristic qualities, and the known role of fused lobes in 
glycoprotein processing, we were interested in the potential role of 
Drosophila Hexol and Hexo2 in cleaving the final GlcNAc residues 
on Rhi in the secretory pathway. 

For the hexol locus, we identified an EMS-generated allele, 
hexol^^^^, as well as a P-element allele, hexoT"'""" , both of which 
are depicted in Figure 7A. Mutations in hexol lead to defects in 
Rhi deglycosylation, as indicated by the accumulation of Rhi in a 
slightly higher MW form in both hexol mutants (Figure 7B). The 
very subtle difference in size between the high MW form of Rhi 
observed in hexol mutants and the wild-type form of Rhi is 
consistent with a defect in removing a GlcNAc residue at the very 
end of the deglycosylation cascade. This high MW form of Rhi is 
sensitive to digestion with PNGase F, but insensitive to digestion 
with Endo H (Figure 7C). These results indicate that the 
hyperglycosylated Rhi protein detected in hexol mutants has 
already been processed by the Class II ot-mannosidases in the 
Golgi. Taken together, these results are consistent with a role for 
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Figure 6. Drosophila Fdl is essential for Rhi deglycosylation. (A) Intron/Exon structure for fused lobes (fdl, CG8824), indicating the alleles used 
in this study: P[GD1956]'"^^', P[GD1956]'"^^^, and P[XP]fdf"'^"' . Blue = coding sequence, red = additional mRNA, yellow = RNAi target. (B) Western blot 
of Rhi protein from fused lobes mutants. Lanes: (1) Wild-type (WT), (2) fdr^'", (3) fdr'^^", (4) fd/"'"^^' and (5) WT. One half of a head was loaded in 
Lanes 1, 2, and 5, whereas 4 heads were loaded in Lanes 3 and 4. (C) Western blot of Rhi protein from the fused lobes alleles described in (B), treated 
(+) with either Endo H or PNGase F enzyme. (D) Proposed role for Drosophila fused lobes in trimming the terminal GlcNAc residues attached to iV14 
during Rhi biosynthesis. 
doi:1 0.1 371 /journal.pgen.1 004349.g006 



Hexo 1 in the removal of the N2 GlcNAc residue from the base of 
the N-glycaii during Rhi biosynthesis (Figure 7D). 



hexo2^"' 



and 



Analysis of two /texo2 P-element alleles, 

QiU)6829 

(data not shown). This does not rule out a potential role for Hexo2 



hexo2^"^'''''^'^ , did not reveal defects in the MW of Rhi protein 



in Rhi deglycosylation, as there could be other enzymes with 
redundant functions. However, the finding that Rhi is present 
exclusively in a high MW form in both the fused lobes (Figure 6B) 
and hexol (Figure 7B) mutants, and that these two MWs are clearly 
distinct from one another (Figure 3B), indicates that both fused 
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Figure 7. Drosophi/a Hexo'\ is essential for RKil deglycosylation. 

(A) Intron/Exon structures for hexosaminidase! {hexol, CG1318), 
indicating the alleles used in this study: Pbac[RB]liexof°'"" and 
hexol'^^^^". Blue = coding sequence, red = additional mRNA. (B) Western 
blot of Rhi protein from tiexol mutant flies. Lanes: (1) WT, (2), 
hexo/°^'^^'°"^'-'^'^''^^\ (3) hexoT""""", and (4) WT. Deficiency 
Df(3L)ED4341 deletes cytological region 63F6-64B9 and thus fails to 
complement the hexol locus, which lies at 64A12. 2 heads were loaded 
per lane. (C) Western blot of Rhi protein from the hexol alleles 
described in (B), treated (+) with either Endo H or PNGase F enzyme. (D) 
Proposed role for Drosophila Hexol in trimming the N2 GlcNAc residue 
from the N-glycan during Rhi biosynthesis. 
doi:1 0.1 371 /journal.pgen.1 004349.g007 



lobes and Hexol are independently required for unique steps in 
Rh 1 deglycosylation. 

Summary: Rhi is deglycosylated in a step-wise fashion 

Here, we demonstrate that a-Man-II, a-Man-IIb, fused lobes, 
and Hexol, all play essential and unique roles during Rlil 
maturation. Although three of these enzymes have known roles in 
N-glycoprotein processing, Hexo 1 has not been previously shown 
to be involved in protein biosynthesis. Our results demonstrate 
that Hexo 1 plays a critical role during Rh 1 deglycosylation in the 
secretory pathway and provide direct evidence that the N-linked 
oligosaccharide on Rh 1 is removed in a step-wise fashion, down to 
the final GlcNAc residue (Figure 4). Our findings, demonstrating 
step-wise trimming of the final trimannosyl core residues during 
Rhi biosynthesis reveal exciting new possibilities for the complex 
functions of glycosyl hydrolases in the secretory pathway. 

Discussion 

It has been known for over 25 years that Rhi is deglycosylated 
during biosynthesis [36], however much of the deglycosylation 
pathway for Rh 1 has remained unknown. In this same timeframe, 
an enormous body of research has led to the identification and 
characterization of a wide variety of glycosyl hydrolase enzymes 
involved in N-glycan processing. These studies have focused 
predominantly on the biochemical characterization of mammalian 
glycosyl hydrolases in cell culture. Drosophila remains a highly 
effective genetic model for the dissection of biological pathways 
and has the potential to yield enormous insight into the functions 
of N-glycans in vivo. To date, a comprehensive informatics 
comparison between human and Drosophila glycosyl hydrolases 
involved in N-glycan processing has not been presented. Here, we 
present a framework for such a comparison between the human 
and Drosophila glycosyl hydrolase families (Figures 2 and S2-S7) 
and describe a role for Drosophila glycosyl hydrolases in the 
sequential trimming of the N-glycan during Rh 1 biosynthesis. We 
characterize mutations in four critical glycosyl hydrolase enzymes, 
ot-Man-II, Qt-Man-IIb, fused lobes, and Hexol, and unveil a highly 
coordinated pathway for Rhi deglycosylation in Drosophila 
photoreceptor cells (Figure 4). 

Whereas a-Man-II, a-Man-IIb, and fused lobes have previously 
been implicated in N-glycoprotein processing, Hexo 1 has not, and 
therefore our fmdings on the role of Hexol in the secretory 
pathway are particularly intriguing. Although hexosaminidases are 
typically thought of as lysosomal enzymes involved in N-glycan 
catabolism, both human and Drosophila hexosaminidases have been 
previously identified outside of the lysosome. For example. Hex A 
has been identified on the plasma membrane of human fibroblasts 
[80] and Hex B has been shown to localize to the sperm acrosome, 
which is derived from the Golgi apparatus [79]. All three Drosophila 
hexosaminidases have been identified on the sperm plasma 
membrane and are thought to mediate egg-sperm binding by 
forming complexes with oligosaccharide substrates present on the 
egg zona peUucida (ZP) [47,78]. Interestingly, it has been 
suggested that the family of P-hexosaminidases may be posttrans- 
lationally and differentially processed in different tissue types, 
resulting in either a membrane-bound form of the enzyme 
(typically on the plasma membrane) or a soluble form that is 
either localized to the cytosol of the lysosome or secreted from the 
cell [47] . Given this apparent flexibility in cellular localization, and 
the non-traditional roles that these 'lysosomal' enzymes have 
already been shown to play, a role for Hexol during Rhi 
maturation in the secretory pathway is not unprecedented. 



PLOS Genetics | www.plosgenetics.org 



12 



IVlay2014 | Volume 10 | Issue 5 | e1004349 



Glycosyl Hydrolases Key for N-Glycan Trimming during Rhi Biosynthesis 



The predicted evolutionary relationship between the human 
and Drosophila genes from GH Family 20 provides further insight 
into the potentially broad functions of hexosaminidases. Based on 
amino acid analysis, the phylogenetic tree shown in Figure S6A 
predicts a single hexosaminidase in the last common ancestor 
between humans and Drosophila. Therefore, the human gene pair 
and the Drosophila gene triplet have evolved separately, through 
independent duplication events. As is always the case, gene 
duplication provides the evolutionary potential for functional 
specialization and it is very likely that the Drosophila loci have 
evolved unique and/ or additional functions compared to the 
human loci. This has already been shown for the fused lobes 
hexosaminidase from this gene family. Finally, as with the EtEXA 
and HEXB loci in humans, the three Drosophila loci from GH 
Family 20 {fdl, hexol, and hexo2) are thought to encode subunits that 
can pair in unique combinations to form dimeric hexosaminidase 
enzymes [78]. For example, two dimeric hexosaminidase enzymes 
have been identified on the sperm plasma membrane in Drosophila, 
including HEXA (which consists of one fused lobes subunit and 
one Hexo2 subunit) and HEXB (which consists of one Hexol 
subunit and one Hexo2 subunit) [47]. Factoring in homo-dimeric 
combinations, the human gene pair can yield only three 
hexosaminidase enzymes. In contrast, the three Drosophilii loci 
can encode a total of six unique enzymes. Again, this allows 
significantly greater freedom for functional specialization of the 
various enzymes and makes a role for Hexol and Fused Lobes in 
glycoprotein biosynthesis entirely feasible. 

Our results, demonstrating that Hexol plays a critical role for 
carbohydrate trimming during Rhl biosynthesis in the secretory 
pathway, reveal intriguing new possibilities for the complex roles 
of glycosyl hydrolases traditionally thought to function as catabohc 
enzymes in the lysosome. For example, it is possible that the two a- 
linked mannose residues, M4 and \15, are remcn-ed by one (or 
more) of the six Drosophila lysosomal ot-mannosidases from GH 
Family 38 Subgroup B (Figures 2 and S4). The pathway for Rhl 
deglycosylation would then require the removal of a |3-linked 
mannose, M3, which points to a role for the Drosophila lysosomal P- 
mannosidase from GH Family 20 (Figures 2 and S6). Given the 
requirement by Rhl for these trimming e\ents, which should 
occur upstream of Hexol, we hypothesize that, like IIcxol, these 
'catabolic' enzymes may play roles as processing enzymes during 
glycoprotein biosynthesis. 

Interestingly, the six lysosomal a-mannosidases in Drosophila 
display significant amino acid identity with the major lysosomal a- 
mannosidase in humans, MAN2B1 (Figures S5A and S5C), having 
relatively no similarity to the minor lysosomal a-mannosidase in 
humans, MAN2B2. Importantly, the major lysosomal a-manno- 
sidase has high specificity for cleavage of M4, whereas the minor 
lysosomal a-mannosidase preferentially cleaves M5 (Figure 1). The 
complementary actions of these two enzymes, along with their 
ubiquitous expression, suggest that they function in tandem for N- 
glycan degradation in mammals. These findings indicate that, if 
they play a role in Rhl deglycosylation, the Drosophila lysosomal a- 
mannosidases likely function to cleave M4. Given the absence of a 
direct Drosophila homolog for the mammalian minor lysosomal a- 
mannosidase (Figure 2), it is difficult to predict which glycosyl 
hydrolase functions to cleave M5. 

Just as there is uncertainty regarding the identity of the enzyme 
responsible for trimming M5 from the trimannosyl core, there is 
also uncertainty regarding the role ot Drosophila a-Man-IIb in the 
pathway for Rhl deglycosylation. In this study, we demonstrate 
that a-Man-II and a-Man-IIb are both required at distinct steps 
during Rhl maturation and that a-Man-IIb functions 
downstream from a-Man-II (Figure 5). It has previously been 



shown that a-Man-II functions for the sequential removal of two 
mannose residues, M7 and M8, during N-glycan processing 
(Figure 5) [11]. In contrast, the role of a-Man-IIb is currently 
unknown. Interestingly, Drosophila a-Man-II, a-Man-IIb, and the 
six lysosomal a-mannosidases are all Class II a-mannosidases from 
GH Family 38 (Figure 2). Accordingly, these enzymes all have the 
capacity to remove al,3- and al,6-hnked mannose residues from 
the N-glycan structure, including M4, M5, M7, and M8 (Figure 1). 
Given the predicted role of the lysosomal a-mannosidases in 
cleaving M4 and the known role of a-Man-II in cleaving M7 and 
M8, one possibilit)' is that a-Man-IIb functions to remove M5. 
While this theoretical pairing addresses two unresolved issues in 
the pathway, there is currently no direct evidence to support a role 
for a-Man-IIb in trimming M5. 

In recent decades, there has been considerable interest in the 
fundamental nature of insect glycoprotein processing pathways 
due to the widespread use of the baculovirus-insect cell expression 
system [81]. This system is used to produce recombinant 
mammalian glycoproteins for a variety of different biomedical 
research applications. The insect system is advantageous, as it 
produces recombinant glycoproteins more readily than mamma- 
lian cell expression systems [82] . A major limitation is that insect- 
cell-produced glycoproteins have significantly diflFerent N-glycan 
structures than those produced by mammahan cells and are rarely 
of the complex-type [81,82]. In part, this is due to the presence of 
the fused lobes P-JV^acetylglucosaminidase, which removes a 
critical GlcNAc residue that serves as a building block for 
complex-type N-glycans (Figure 4). This event was proposed to be 
a routine step during glycoprotein maturation in the secretory 
pathway and, accordingly, some fused lobes expression was 
detected in the Golgi when expressed in yeast [10]. 

A Golgi hexosaminidase with a role in N-glycoprotein 
processing has also been identified in the lepidopteran insect, 
Spodoptera frugiperda (Sf21 and S19 cells) [64,83]. This fused lobes 
ortholog, Sf-FcU, has a slightiy higher pH optimum of 6.0 [64,83] 
compared to Drosophila fused lobes, which has a pH optimum of 
5.5 [10]. However, Drosophila fused lobes has a range of activity 
through pH 6-7, and therefore fused lobes is active in the pH 
range typical of secretory pathway compartments, such as the 
Golgi [84] . When expressed in yeast, fused lobes was also detected 
on the plasma membrane and in multivesicular bodies [10]. 
Likewise, fused lobes was detected on the sperm plasma 
membrane [47]. Interestingly, there is currentiy no report of 
fused lobes expression in lysosomes. 

Due to widespread interest in improving the baculovirus-insect 
cell expression system, identification of the fused lobes locus in 
Drosophila melanogaster [10] and the lepidopteran insect, Spodoptera 
fiugiperda [83] was of great interest to a broad audience. Here, 
using Rhl protein as a substrate, we confirm that Drosophila fused 
lobes is indeed required for glycoprotein biosynthesis in vivo. 
Consistent with a range of activity between pH 6-7 [10], our 
results demonstrate that fused lobes is capable of functioning in the 
secretory pathway during Rhl maturation. These, and future, 
studies on the function of fused lobes will likely assist in the 
development of baculovirus-insect cell expression systems capable 
of producing more mammalian-like recombinant glycoproteins. 
For example, genetic elimination or inactivation of fused lobes in 
these cell expression systems might promote the formation of the 
desired complex carbohydrates on mammalian glycoproteins. 

Here, we demonstrate that Drosophila Rhl serves as an excellent 
reporter protein for studying N-glycosylation. In the present study, 
we have used the molecular weight of Rhl protein, and its 
sensitivity to Endo H and PNGase F, in order to gain insights into 
the roles of the glycosyl hydrolases in the N-glycan trimming 
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cascade. While our data are consistent with the hypothesis that the 
biochemical activities of these enzymes are the same as they are in 
vertebrates, precise identification of the glyco-intermediates wiU 
require mass spectrometry or nuclear magnetic resonance (NMR) 
analysis. When paired with the genetic and biochemical data, 
structural analyses of the oligosaccharides present on the various 
forms of immature Rhl presented here wiU be highly useful for 
characterizing N-glycan structure-function relationships. 

Studies involving mutations in N-glycan processing enzymes in 
intact model organisms provide a rich source of information on the 
physiological functions of these enzymes in vivo. Analysis of our 
collection otDrosophila mutants has uncovered unexpected roles for 
glycosyl hydrolases in Rhl biosynthesis. Furthermore, given the 
many severe diseases that result from defects in N-glycosylation, 
including congenital disorders of glycosylation (CDG), ot-mannos- 
idosis, P-mannosidosis, Tay-Sachs disease, and SandhofF disease, 
animal models will likely yield important insights into disease 
mechanisms. In the eye, glycosylation of rhodopsin has long been 
known to be key for photoreceptor health and function. Mutations 
that prevent the N-glycosylation of Drosophila rhodopsin cause 
photoreceptor defects and retinal degeneration [38,85-87]. In 
humans, mutations at the two sites for N-linked glycosylation have 
been identified in patients with the hereditary retinal degeneration 
disorder, autosomal dominant retinitis pigmentosa (T4K, N15S 
and T17M) [88-90]. More recently, failures in oligosaccharide 
trimming during rhodopsin biosynthesis have also been shown to 
cause photoreceptor defects in Drosophila [1 1] . Consistent with 
these findings, the original mutants identified in this study 
originated from a deep pseudopupU (DPP) screen, indicating that 
they display photoreceptor defects and/or undergo retinal 
degeneration. Therefore, future studies of these glycosyl hydrolases 
win likely provide insights into mechanisms of hereditary retinal 
degeneration disorders, as well as other human diseases. While 
Rhl deglycosylation appears to represent a highly unique 
phenomenon, it remains to be seen whether other glycoproteins 
are similarly processed and how broadly the 'catabolic' glycosyl 
hydrolases influence glycoprotein processing. Our study represents 
a critical step towards a genetic dissection of N-glycan processing 
in Drosophila, and yields important insights into the process of N- 
glycan trimming during Rhl biosynthesis in the eye. 

Materials and Methods 

Drosophila strains 

Drosophila melanogaster stocks were reared on standard media at 
22°C, on a 12:12 light:dark cycle. The wild-type stocks used in this 
study were Canton S. and the parental strain from the EMS- 
mutagenesis, brown;scarlet {hw;st). To isolate the a-man-IIb''^^^^ and 
kexol^^^^ mutants, we screened approximately 12,000 EMS (ethyl 
methyl sulfonate) mutagenized lines from the Zuker collection [41] 
for the presence or absence of the deep pseudopupil (DPP), as 
previously described [91-93]. Nini; hundred DPP-defective stocks 
were identified and then further screened for defects in Rhl 
deglycosylation by Western blot analysis. Deficiency stocks used 
for mapping the EMS-generated mutations were obtained from 
the Bloomington Drosophila Stock Center (Indiana University), as 
were the P[EP]a.-Man-lf*'"" (Stock #30106), PBacfRBJ- 
Hexol'"'""" (Stock #17805), and Pbac[WH]Hexof'^''^'' (Stock 
#18732) P-element alleles. The remaining P-element alleles used 
in this study were obtained from the Exelixis Collection (Harvard 
Medical School): P[XP]a.-Man-Il"'^*^\ Pbac[WHJ(x-Man-IIl/"^^^*, 
Phac[\VH]ai-Man-M''-^'^'^\ Phac[RBJa-Man-IIb''""^\ PhacfPBJa- 
Man-IIb''"'"" , P[XP]fdi:"'"", and P[XP]Hexo/"''''^" . The following 
transgenic Drosophila strains, each containing an inducible 



UAS-RNAi construct directed against either the oc-man-II or Jiised 
lobes transcripts, were obtained from the Vienna Drosophila 
RNAi Center (VDRC): P[GD2S75f''''-''\ P[GD1956J'""^^^, and 
P[GD1956J'""''"'. Eye-specific knockdown of the target mRNA 
transcripts was achieved by performing standard genetic crosses 
between the UAS-RNAi strains and a Gal4 driver line provided by 
Claude Desplan: P[UAS-Dcr2];P[ey-Gal4],P[IGMR-Gd4];+. The 
Gal4 driver line contained both eyekss-Gal4 and IGMR-Gal4 
drivers recombined on the second chromosome. Together, these 
two drivers induced RNAi expression in the whole eye from the 
time the eye is differentiated through adulthood [61]. The RNAi 
driver line also harbored a third construct on the X chromosome, 
UAS-Dicer2, to enhance the efficiency of generating small 
interfering RNA. Finally, mosaic mutants for oc-man-II were 
generated with the P-element insertion line pBac[SAstopDsRed]ix- 
man-lF-"'"^*. Specifically, we used the FLP-FRT, P[GMR-hid] 
method using an eyeless-FLP driver to induce genetic mosaics and 
generate mitotic clones of a single genotype in the eye [60] . Both 
the a-man-//aUele (DGRC Stock #140255) and die ey-FLP driver 
Hne (Bloomington Stock #5253) were generously provided by 
Junhai Han. 

Deficiency mapping and DNA sequencing of EMS alleles 

We performed deficiency mapping to narrow the cytogenetic 
locations of the two EMS-generated mutations to 89A5 [a-man- 
jHG572E^ and 64A12 {hexofi-^^^^) on the third chromosome. To 
identify the mutant loci, we sequenced prioritized candidate genes 
within the corresponding deficiency regions. Genomic DNA was 
isolated from the EMS-mutagenized lines [a.-man-IIb''''''^'^^ and 
hexof'-''^''^) and the wild-type parental line {bw;si) using the DNeasy 
Blood and Tissue Kit, according to the manufacturer's instructions 
(QIAGEN Inc., Valencia, CA). Primer pairs were designed for 
numerous loci in the deficiency regions, based on their GenBank 
sequence accession numbers. PCR-amplified DNA serjuences 
were determined by the DNA Sc-quencing Facility at the 
University of Wisconsin Biotechnology Center and were aligned 
using the GAP v4.7 appUcation (The X Window Systems, XI 1 
1.1.3 -XFree86 4.4.0). In the first mutant, we identified a G to A 
substitution at nucleotide position 1715 in the coding region of 
CG4606. This mutation corresponds to a substitution at amino 
acid (aa) 572, from glycine (G) to glutamate (E) (G572E). CG4606 
encodes a Class II ot-mannosidase enzyme from GH Family 38, 
termed a-mannosidase-IIb (a-Man-IIb). In the second mutant, we 
identified a C to T substitution at nucleotide position 1774 in the 
coding region of CGI 3 18. This mutation generates a premature 
stop codon (Och) at aa 592, in place of a glutamine (Q) residue 
(Qp92X). CGI 3 18 encodes a P-N-acetylglucosaminidase enzyme 
from GH Family 20, termed hexosaminidase 1 (Hexol). Identi- 
fication of the a.-man-IIb'^^^^^ and faoi*^*^^ mutants served as the 
springboard for a larger analysis of the enzymes involved in N- 
glycosylation. 

Western blotting 

FUes were reared at 22°C and then aged for 1 week at 29°C 
before head tissue was collected for Western blot analysis. We 
loaded enough tissue in each case to determine the MW of Rhl. 

Proteins were separated by electrophoresis in SDS polyacrydamide 
gels and electroblotted onto nitrocellulose membranes as previ- 
ously described [39]. Rhl protein was detected with a monoclonal 
antibody (4C5) obtained from the Developmental Studies 
Hybridoma Bank (University of Iowa) [36]. Rhl-bov protein 
(Figure 3 A) was detected with a monoclonal antibody (1D4) 
directed to the bov epitope, obtained from Phyllis Robinson. 
The immunoreactive proteins were visuahzed using horseradish 
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peroxidase (HRP)-conjugated goat anti-mouse (Invitrogen Corpo- 
ration, Carlsbad, CA) followed by ECL detection (Amersham 
Pharmacia Biotech, Piscataway, NJ). 

Biochemical procedures 

All reagents for thi; Endo H and PNGase F digestions were- 
obtained from New England BioLabs (Ipswich, MA). Between 20- 
40 fly heads were homogenized into denaturing buffer, sonicated, 
and processed according to a modification of the manufacturer's 
instructions. Specifically, we increased the denaturing buffer to a 
final concentration of 1% SDS and incubated for 4 hours at 22°C. 
AU samples were mixed with appropriate volumes of sample buffer 
[94] and assessed via Western blot analysis. 

Heat pulse-chase experiments 

For heat pulse-chase experiments, we used transgenic flies 
expressing wild-type Drosophila Rhl [ninaE) tagged with a 12 amino 
acid epitope from the C-terminus of bovine rhodopsin (Rhl-bov) 
[40]. The epitope tag adds ~2 kD to Rhl, such that the immature 
forms are detected at 42 and 40 kD, and the mature form is 
detected at 36 kD. The tag does not affect Rhl maturation or 
function [40]. We used transgenic flies expressing Rhl-bov under 
the control of either the Drosophila hsp70 heat-shock promoter, 
P[hs:Rhl-bov] [40] or under the control of the endogenous Rhl 
promoter, P[ninaE:Rhl-bov] [40]. Wild-type flies expressing the 
P[hs:Rhl-bojj] construct were exposed to a 1 hour heat shock at 
37°C and then shifted back to room temperature and assayed at 
the indicated times, in hours. Wild-tyjoe and nmaA^^''' mutant flies 
expressing the P[ninaE:Rhl-bov] construct were shown to indicate 
the mature and immature forms of Rhl under non-heat shock 
conditions. 

Supporting Information 

Figure SI Age and temperature data. Western blots of Rhl 
protein from flies aged for either 1-day or 10-days at room 
temperature (22°C) (Lanes 2 and 3) and 1-day or 10-days at 29°C 
(Lanes 4 and 5). In all cases, lane 1 is a wild-type (WT) control 
sample aged for 1-day at room temperature (22°C). AU Rhl blots 
(top) were re-probed with a monoclonal antibody directed to P- 
tubulin (P-tub) as a loading control (bottom), as indicated in A. (A) 
Wild-t\'pe (Canton S.), (B) a-man-Il mutant alleles from top to 
bottom: a-man-U , a.-man-ii , a-man-U , and a-man- 

II '", (C) c/.-man-IIb mutant alleles from top to bottom: cc-man- 
lib"''''', a-man-IIl/"'''\ a-man-IIlf"''' , a-man-IIb"" , and 
a.-man-IIb'"^^'' , (D) fused lobes mutant alleles from top to bottom: 
fdl"'^'",fdr*'''",fdf'*^'-^'', and (E) hexol mutant afleles from top to 
bottom: hexol^'''^'^ and hexol'"'""^' . 
(PDF) 

Figure S2 Glucosidase I and II. Full-length amino acid (aa) 
alignments between the evolutionarily related human (h) and 
Drosophila (d) glucosidase proteins, generated with the UniProt 
Align program using the GenBank sequence accession numbers 
listed in Figure 2. Identical amino acids are marked with asterisks 
(*), strongly similar amino acids are marked with two dots (:), and 
weakly similar amino acids are marked with one dot (.). 
Underlined regions represent predictc'd transm(;mbranc domains 
(TMHMM Server v.2.0). The glucosidase- I enzyme is encoded by 
a single locus in humans, GCSl. In contrast, human glucosidase II 
consists of both an a- and (3-subunit, encoded by GANAB and 
PRKCSH, respectively. In Drosophila, we have identified CGI 597, 
CG14476, and CG6453 as the homologs for each of the 
glucosidase components characterized in humans, respectively 
(Figure 2). These Drosophila genes have not been previously 



characterized or annotated (FhBasc: [SI]) and here, we show that 
they display clear homology to the known enzymes in humans. (A) 
Human glucosidase I (GCSl) and the Drosophila homolog 
(CGI 597) share 41% overall aa identity and 43% aa identity 
within the GH Family 63 domain {Drosophila aa62-849). (B) The a- 
subunit of human glucosidase II (GANAB) and the Drosophila 
homolog (CG14476) share 47% overall aa identity, 53% identity 
within the GH Family 31 carbohydrate transport domain 
{Drosophila aal08-891), 58% aa identity within the GH Family 
31 general domain {Drosophila aa362-699), and 62% identity 
within the galactose mutarotase-like domain {Drosophila aa236- 
304). Yellow shading indicates the putative catalytic sites and 
purple shading indicates additional putative active sites [S2-4]. (C) 
The P-subunit of human glucosidase II (PRKCSH) and the 
Drosophila homolog (CG6453) share 39% overall aa identity, 49% 
aa identity within the PRKCSH-like domain {Drosophila aa26- 
159), and 64% aa identity within the Low Density Lipoprotein 
Receptor Class A domain {Drosophila aa85-120). Blue shading 
indicates putative calcium-binding sites [S5] and purple shading 
indicates putative surface binding sites [86] . Both proteins contain 
a C -terminal HDEL endoplasmic reticulum retention signal (bold) 
[S7]. 
(PDF) 

Figure S3 Class I al,2-mannosidases (GH Family 47). There are 
seven members of GH Family 47 in humans, divided into three 
major subcategories based on their cellular localization and 
biochemical properties, including the ER subfamily (Subgroup 
A, orange), the Golgi subfamily (Subgroup B, yellow), and the 
Edem subfamily (Subgroup C, green). We have identified five GH 
Family 47 mannosidases in Drosophila. Amino acid sequence 
analysis revealed significant homology' between specific human 
and Drosophila enzymes in GH Family 47, allowing us to assign the 
Drosophila proteins to specific subfamilies, as outlined below and in 
Figure 2. (A) Phylogenetic tree depicting the predicted evolution- 
ary relationships between the Class I a-mannosidases from GH 
Family 47 in humans (h) and Drosophila (d) generated with the 
UniProt Align program using the GenBank sequence accession 
numbers listed in Figure 2. Black arrows designate speciation of 
the last common ancestor between humans and flies, leading to the 
production of orthologs. White arrows denote presumed gene 
duplication events, leading to the production of paralogs. 
According to these data, multiple gene duplication events within 
the Golgi subfamily (Subgroup B, yellow) likely occurred late in 
evolution, independently giving rise to both the mammalian gene 
family [S8] and the Drosophila gene pair [S9] . Therefore, in the last 
common ancestor between Lhosophila and humans (see dotted fine), 
there was likely only a single Golgi al,2-mannosidase gene 
(yellow). This hypothesis is further supported for the human loci by 
the similar positioning of intron/exon boundaries [SIO]. The 
particularly high overall aa identity (60%) and overlap in 
enzymatic specificity between human Golgi MAN lA and Golgi 
MAN IB suggests that they represent one of the most recent gene 
duplications in the a-mannosidase family [S8] . As with the human 
loci, it has been proposed that a duplication event, occurring at 
some point during evolution of the Drosophila lineage, gave rise to 
CG31202 from its parental locus, a-Man-I (CG42275) [S9]. 
Therefore, although the human and Drosophila Golgi loci c'\ ()lv(;d 
from a common ancestral locus, unique duplication events in each 
lineage gave rise to the multiple forms that exist today and detailed 
biochemical confirmation of the precise functions of the Drosophila 
enzymes is lacking. In contrast to Subgroup B, the phylogeny 
suggests that three EDEM (ER degradation enhancing a- 
mannosidase-like) proteins were present in the last common 
ancestor between Drosophila and humans (Subgroup C, green). 
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However, the most extensively studied EDEM in humans, 
EDEMl, does not appear to have a direct counterpart in 
Drosophila. This would suggest that the ortholog of mammalian 
EDEMl was lost throughout the course of evolution in the 
Drosophila lineage. (B) Partial amino acid (aa) alignment between 
the human (h) and Drosophila (d) Class I a-mannosidases, generated 
with the UniProt Align program using the GenBank sequence 
accession numbers listed in Figure 2. Numbers refer to aa residues. 
Highlighted in red is a critical arginine (R) residue, conserved only 
among members of Subgroup A (orange). This R residue is a 
hallmark of ERManI enzymes and is responsible for the high level 
of specificity in cleaving MIO on branch B (Figure 1). Among the 
five GH Family 47 members identified in Drosophila, a-Man-Ib is 
the only mannosidase that harbors this R residue. AH other GH 
Family 47 members contain either a leucine (Subgroup B, yellow) 
or a glycine (Subgroup C, green) at this position. (C) Full-length aa 
alignment between human (h), Drosophila (d), and yeast (y) GH 
Family 47 members from Subgroup A, generated with the UniProt 
Align program using the GenBank sequence accession numbers 
listed in Figure 2. Identical amino acids are marked with asterisks 
(*), strongly similar amino acids are marked with two dots (:), and 
weakly similar amino acids are marked with one dot (.). 
Underlined regions represent predicted transmembrane domains 
(TMHMM Server v.2.0).The first trimming event mediated by 
GH Family 47 involves the removal of a terminal mannose residue 
(MIO) on branch B (Figure 1), to generate Man8GlcNAc2 [49,50]. 
The enzyme responsible for this step is an ER resident Q(l,2- 
mannosidase, also known as ER a-mannosidase I (ERManI). 
ERManI is the sole member of Subgroup A within GH Family 47 
and has been well characterized in S. cerevisiae (MNSl, 
NP_012665.1) [Sll] and in humans (MANIBI) [S12,S13]. 
Drosophila a-Man-Ib displays 4 1 % overall aa identity with human 
ERManI and 54% aa identity within the GH Family 47 domain 
{Drosophila aa242-680). For a description of the aa shading, see 
below. (D) Full-length aa alignment between the human (li) and 
Drosophila (d) GH Family 47 members from Subgroup B, generated 
with the UniProt Align program using the GenBank sequence 
accession numbers listed in Figure 2. Identical amino acids are 
marked with asterisks (*), strongly similar amino acids are marked 
with two dots (:), and weakly similar amino acids are marked with 
one dot (.). Underlined regions represent predicted transmem- 
brane domains (TMHMM Server v.2.0). These Golgi al,2- 
mannosidases have the capacity to cleave all four Q(l,2-linked 
mannose residues (M6, M9, MIO, and Ml!) on the N-glycan 
structure (Figure 1) [49]. The mammalian enzymes, referred to as 
Golgi Man LA, IB, and IC, are encoded by MANlAl [S14,S15], 
MAN1A2 [S16,S17], andMANlCl [SIO] in humans, respectively 
(Figure 2). Although these enzymes preferentially cleave specific 
al,2-linked mannose residues from the N-glycan, their functions 
are largely (n'erlapping [49]. In general, the Golgi al,2- 
mannosidases have high specificity for M6, M9, and Mil, located 
on branches A and C of the N-glycan, with significantly lower 
specificity for MIO, located on branch B (Figure 1). In this way, 
their function is complementary to the role of ERManI, which 
primarily cleaves the central mannose residue (MIO) and has very 
low specificity for al,2-linked residues on branches A and C 
[49,S18]. We have identified two GH Family 47 mannosidases 
from Subgroup B in Drosophila, namely a-mannosidase-I (a-Man-I, 
CG42275) and an unannotated locus, CG31202 (Figure 2). The a- 
Man-I enzyme displays between 44-^8% overall aa identity with 
human Golgi Man lA, IB, and IC and between 56-60% identity 
with these enzymes in the GH Family 47 domain (Drosophila 
aal98-647). In contrast, CG31202 display only 37-39% overall aa 
identity with the human enzymes and only 41-43% identity within 



the GH Family 47 domain {Drosophila aa78-512). For a description 
of the aa shading, see below. (E/F) Full-length aa alignments 
between GH Family 47 members from Subgroup C, including (E) 
human EDEM2 ahgned with Drosophila Edeml (CG3810) and (F) 
human EDEM3 aligned with Drosophila Edem2 (CG5682), 
generated with the UniProt Align program using the GenBank 
sequence accession numbers listed in Figure 2. Identical amino 
acids are marked with asterisks (*), strongly similar amino acids are 
marked with two dots (:), and weakly similar amino acids are 
marked with one dot (.). Underlined regions represent predicted 
transmembrane domains (TMHMM Server v.2.0). Class I a- 
mannosidases from Subgroup C represent the most recent 
additions to GH Family 47, and comprise a group of ER-resident 
proteins [SI 9]. In contrast to the other Class I a-mannosidase 
enzymes,that function in the normal biosynthetic trimming of 
newly synthesized glycoproteins, members of Subgroup C function 
in quality control by targeting misfolded glycoproteins for ER- 
associated degradation (ERAD) [S20]. Three members of this 
group, EDEMl [S21], EDEM2 [S22], and EDEM3 [S23] have 
been identified in mammals (Figure 2). In contrast, there are only 
two members of Subgroup C in Drosophila, Edeml (CG3810) and 
Edem2 (CG5682) (Figure 2). Drosophila Edeml displays 54% 
overall aa identity with human EDEM2 and 64% aa identity 
within the GH Family 47 domain {Drosophila aa48^88). Drosophila 
Edem2 displays 48% overall aa identity with human EDEM3 and 
62% aa identity within the GH Family 47 domain {Drosophila 
aa57-494). Human EDEMl is considerably divergent from the 
()tli(;r members of this family and thus has been omitt(;d from 
either alignment. For a description of the aa shading, see below. 
Description of amino acid shading in C-F: Class I C(l,2- 
mannosidases have been extensively studied with regard to 
enzyme kinetics, substrate specificity, structure, and catalytic 
mechanism [50]. Crystal structures for several members of GH 
Family 47 have been solved, including yeast MNSl [S24], human 
ERManI [S25], murine MANlAl [S26], an Oll,2-mannosidase 
from P. citrinum [S27], and an al,2-mannosidase from T. reesei 
[S28]. These structures, in combination with extensive site- 
directed mutagenesis [S29-33] and computational modeling 
[S34], reveal critical insights into the aa residues that are essential 
for the enzymatic activity of GH Family 47 enzymes. Highly 
conserved amino acids that define this evolutionarUy related group 
of enzymes are highlighted, as described below. Green shading 
indicates two cysteine residues (lacking in Edems) that form a 
disulfide bond that is essential for stabilizing the tertiary structure 
of the molecule. Blue shading denotes residues that are either 
directly or indirectly required for Ca^^ binding. Yellow shading 
highlights the catalytic acid (proton donor) and catalytic base that 
are directiy involved in hydrolysis of the glycoside bond. Purple 
shading denotes a host of additional amino acids that play critical 
roles in the energetics of substrate binding. Finally, as explained in 
Figure S3B above, red shading highlights a critical amino acid 
position that houses either an arginine (R) residue in members 
from Subgroup A, a leucine (L) residue in members from 
Subgroup B, or a glycine (G) residue in members from Subgroup 
C. This distinction significantiy alters the specificity of N-glycan 
processing. Mutating this R to L, in both yeast MNSl [S33] and 
human ERManI [S29], dramatically broadens substrate specific- 
ity, allowing the enzymes to cleave additional al,2-linkages and 
leading to the production of atypical Man84^GlcNAc2 structures. 
Furthermore, crystallization of yeast MNSl has confirmed the 
importance of this residue by revealing that it makes direct contact 
with four unique residues on the N-glycan and serves to stabilize 
the central arm of the N-glycan during cleavage of MIO [S24]. In 
contrast, crystallization of a broad specificity al,2-mannosidase 
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from P. citrinum has indicated that when a smaller residue (G in this 
case) is present at this position, the active site is enlarged, allowing 
for a greater degree of flexibility in the binding and trimming of 
additional mannose residues on the A and C branches of the N- 
glycan (Figure 1) [S27]. 
(PDF) 

Figure S4 GIcNAc-transferase I. Full-length amino acid (aa) 
alignment between human (h) and Drosophila (d) GlcNAc- 
transferase I, generated with the UniProt Align program using 
the GenBank sequence accession numbers listed in Figure 2. 
Identical amino acids are marked with asterisks (*), strongly similar 
amino acids are marked with two dots (:), and weakly similar 
amino acids are marked with one dot (.). Underlined regions 
represent predicted transmembrane domains (TMHMM Server 
v.2.0). Following trimming to MansGlcNAcg, the classical 
pathway for oligosaccharide maturation involves the action of 
GlcNAc-transferase I, which functions in the addition of a single 
P 1 ,2-linked N-acetylglucosamine (GlcNAc) to a terminal mannose 
(MA) on branch A (Figure 4) [51,52]. This modification occurs in 
the Golgi and is a prerequisite for the formation of complex N- 
glycans. The enzyme responsible for this function (also called 
Mgatl) is highly conserved between Drosophila (CGI 3431) and 
humans (MGATl). Drosophila Mgatl displays 51% overall aa 
identity with human MGATl and 62% aa identity within the 
GT13 GlcNAc-TI domain {Drosophila aal 19-449). Purple shading 
indicates putative substrate binding sites [S35]. 
(PDF) 

Figure S5 Class II al,2-mannosidases (GH Family 38). There 
are five GH Family 38 members in humans, divided into three 
major subcategories based on their ceUular localization and 
biochemical properties, including the Golgi subfamily (Subgroup 
A, blue), the lysosomal subfamily (Subgroup B, purple) and the 
ERManll subfamily (Subgroup C, grey). We have identified eight 
GH Family 38 mannosidases in Drosophila. Amino acid sequence 
analysis reveals significant homology between specific human and 
Drosophila enzymes in GH Family 38, allowing us to assign the 
Drosophila proteins to specific subfamihes, as oudined below and in 
Figure 2. (A) Phylogenetic tree depicting the predicted evolution- 
ary relationships between the Class II a-mannosidascs from GH 
Family 38 in humans (h) and Drosophila (d), generated with the 
UniProt Align program using the GenBank sequence accession 
numbers listed in Figure 2. Black arrows designate speciation of 
the last common ancestor between humans and flies, leading to the 
production of orthologs. White arrows denote presumed gene 
duplication events, leading to the production of paralogs. The 
Drosophila lysosomal a-mannosidase loci (purple) have undoubtc-dly 
resulted from more recent duplication events and, accordingly, 
display between 65-77% overall aa identity with one another. In a 
recent study, these lysosomal a-mannosidases were designated 
LManI (CG5322), LManll (CG6206), LManlll (CG9463), 
LManlV (CG9465), LManV (CG9466), and LManVI (CG9468) 
[46]. Further evidence for their recent duplication is the 
positioning of the corresponding loci within the genome. The loci 
encoding LManI and II are organized back-to-back at 31E5, 
whereas the loci encoding LManlll-VI are organized in tandem at 
29F1. (B) Full-length amino acid (aa) alignment between GH 
Family 38 members from Subgroup A, including human (h) a- 
Man II (MAN2A1), human a-Man IIx (MAN2A2), Drosophila (d) 
a-Man-II (CGI 8802), and Drosophila a-Man-IIb (CG4606), 
generated with the UniProt Align program using the GenBank 
sequence accession numbers listed in Figure 2. Identical amino 
acids are marked with asterisks (*), strongly similar amino acids are 
marked with two dots (:), and weakly similar amino acids are 



marked with one dot (.). Underlined regions represent predicted 
transmembrane domains (TMHMM Server v.2.0). Drosophila a- 
Man-II displays 40% overall aa identity with both human a-Man 
II and a-Man IIx, 56-57% aa identity within the GH Family 38 
N-terminal catalytic domain [Drosophila aal43-482), and 55-56% 
aa identity within the GH Family 38 Middle domain (Drosophila 
aa477-553). By comparison, Drosophila a-Man-IIb is significantly 
more divergent, displaying only 33-34% overall aa identity with 
the human enzymes, 32-46% aa identity ^\ ithin the GH Family 38 
N-terminal catalytic domain [Drosophila aal67-514), and 51-52'/o 
aa identity within the GH Family 38 Middle domain [Drosophila 
aa508-596). For a description of the aa shading, see below. (C) 
FuU-length amino acid (aa) alignment between human (h) and 
Drosophila (d) GH Family 38 members from Subgroup B, generated 
with the UniProt Align program using the GenBank sequence 
accession numbers listed in Figure 2. Identical amino acids are 
marked with asterisks (*), strongly similar amino acids are marked 
with two dots (:), and weakly similar amino acids are marked with 
one dot (.). In humans, there are two lysosomal a-mannosidases 
from GH Family 38 Subgroup B, namely the major lysosomal a- 
mannosidases (Major Lys, MAN2B1) and the minor lysosomal a- 
mannosidases (MAN2B2) (Figure 2). These enzymes are required 
for removal of the M4 and M5 mannose residues from the N- 
glycan (Figure 1), and have been shown to play a vital role in 
maintaining cellular homeostasis [73]. The major lysosomal a- 
mannosidase has high specificity for cleavage of M4, whereas the 
minor lysosomal a-mannosidase preferentially cleaves M5 
(Figure 1). The complementar)' actions of these two enzymes, 
along with their ubiquitous expression, suggest that they function 
in tandem for N-glycan degradation in mammals. Consistent with 
their lysosomal localization, optimum activity for these enzymes 
occurs between pH 4.0^.2 [66,71,S36]. There are six homolo- 
gous loci in Drosophila, CG5322 (LManI), CG6206 (LManll), 
CG9463 (LManlll), CG9465 (LManlV), CG9466 (LManV), and 
CG9468 (LManVI) [46]. All six Drosophila lysosomal mannosidases 
display 39^4% overall aa identity with the major lysosomal a- 
mannosidase in humans (MAN2B1), 45-50% aa identity within 
the GH Family 38 Middle domain [Drosophila CG5322 aa323- 
398), and 55-60% aa identity within the GH Famfly 38 N- 
terminal catalytic domain [Drosophila CG5322 aa2-276). In stark 
contrast, the Drosophila lysosomal a-mannosidases have littie to no 
aa identity with the minor lysosomal a-mannosidase in humans. In 
fact, the minor lysosomal a-mannosidase in humans is so divergent 
from the other members of this family, that it has been omitted 
from the alignment. For a description of the aa shading, see below. 
Description of amino acid shading in B and C: Class II a- 
mannosidases have been extensively studied with regard to 
enzyme kinetics, substrate specificity, structure, and catalytic 
mechanism. X-ray crystallography has been used to determine 
high-resolution structures of both wild-t)'pe and mutant Drosophila 
a-Man-II enzymes, as well as numerous covalent reaction 
intermediates, bound to various inhibitors, synthetic substrates, 
and/or natural oligosaccharide substrates [12-22]. Structural 
analysis of covalent reaction intermediates has also been achieved 
for other members of GH Family 38 by liquid chromatography- 
mass spectrometry (LC-MS), including Jack Bean a-mannosidase 
[S3 7] and a lysosomal a-mannosidase from bovine kidney [S38]. 
These structures, in combination with site-directed mutagenesis, 
provide critical insight into the aa residues that are essential for the 
enzymatic activity of GH Family 38 enzymes. Highly conserved 
amino acids that define this evolutionarily related group of 
enzymes are highlighted, as described below. Yellow shading 
highlights the nucleophile and acid/base catalyst that are directiy 
involved in hydrolysis of the glycoside bond. Dark blue shading 
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indicates residues tliat are required for Zn binding. Purple 
shading denotes a host of additional amino acids that play critical 
roles in the energetics of substrate binding. Some highlighted 
residues only pertain to subgroup A: Pink shading indicates 
residues involved in the holding site, which is only found in 
enzymes that cleave the M7 and M8 mannose residues (Figure- 1). 
Red shading denotes residues involved in the anchor site, which 
is only found in enzymes that require the presence of a terminal 
GlcNAc on branch A in order to cleave M7 and M8. Green 
shading indicates four cysteine residues that form two different 
disulfide bonds, one of which is lacking in Drosophila a-Man-IIb. In 
Figure S5B, light blue shading shows the single residue mutated 
in our Dnmphila a-Man-IIb EMS allele (G572E). In Figure S5C, 
light blue shading denotes missense mutations that cause a- 
mannosidosis in humans [S39]. 
(PDF) 

Figure S6 Hexosaminidases (GH Family 20). (A) Phylogenetic 
tree depicting the predicted evolutionary relationships between the 
hexosaminidases from GH Family 20 in humans (h) and Drosophila 
(d), generated with the UniProt Align program using the GenBank 
sequence accession numbers listed in Figure 2. This list includes 
the human P-hexosaminidase, a-chain (HEXA), the human (3- 
hexosaminidase, P-chain (HEXB), Drosophila hexosaminidase 1 
(Hexol, CG1318), Drosophila hexosamindase 2 (Hexo2, CG1787), 
and Drosophila fused lobes (Fdl, CG8824). Black arrows designate 
speciation of the last common ancestor between humans and flies, 
leading to the production of orthologs. White arrows denote 
presumed gene duplication events, leading to the production of 
paralogs. Based on this analysis, there was likely a single 
hexosaminidase in the last common ancestor between humans 
and Drosophila, thus the human gene pair and the Drosophila gene 
triplet have e\ ()lved separately through independent duplication 
events. (B) Full-length amino acid (aa) alignments between 
evolutionarily related human (h) and Drosophila (d) hexosaminidas- 
es from GH Family 20, generated with the UniProt Align program 
using the GenBank sequence accession numbers listed in Figure 2. 
This list includes the human P-hexosamrnidase, a-chain (HEXA), 
the human P-hexosaminidase, P-chain (HEXB), Drosophila Hexol 
(CG1318), Drosophila Hexo2 (CG1787), and Drosophila fused lobes 
(CG8824). Identical amino acids are marked with asterisks (*), 
strongly similar amino acids are marked with two dots (:), and 
weakly similar amino acids are marked with one dot (.). 
Underlined regions represent predicted transmembrane domains 
(TMHMM Server v.2.0). All pair-wise comparisons between the 
human and Drosophila proteins yield 30-32% overall aa identity 
and 3,5-38 'Xi aa identity within the GH Family 20 catalytic 
domain (d Hexol aa214-557, d Hexo2 aa235-581, and d fused 
lobes aa276-613). Crystallization, photoaffinity labehng, expres- 
sion of chimeric subunits, and site-directed mutagenesis of the two 
human lysosomal P-hexosaminidases reveal a number of aa 
residues that are critical for the enzymatic activity of GH Family 
20 [S40-47]. Highly conserved amino acids that define this 

References 

1. Roth J, Zubcr C, Park S.Jang I, Lrc Y, rt al. (2010) Protein N-glycosylation, 
protrin folding, and protein quality eontrol. Mol Cells 30: 497—506. 

2. Katoh r. l iemever M (2013) t he N's and O's of Drosophila glycoprotein 
glycobiology. GlycoconjJ 30: 57-66. 

3. Komfeld R, Komfeld S (1985) Assembly of asparagine linked oligosaccharides. 
Annu Rev Biochem 54: 631-664. 

4. Silberstein S, Gilmore R (1996) Biochemistry, molecular biology, and genetics of 
die oligosaccharyltransferase. FASEB J 10: 849-858. 

5. Stanley P (2011) Golgi glycosylation. Cold Spring Harb Perspcct Biol 3: pii: 
a005199. doi: 10. 1 101/cshpcrspcct.a005199. 

6. Caramelo JJ, Parodi AJ (2007) How sugars convey information on protein 
conformation in the endoplasmic reticulum. Semin Cell Dev Biol 18: 732—742. 



evolutionarily related group of enzymes are highlighted, as 
described below. Yellow shading highlights the general acid/ 
base catalyst. Green shading indicates cysteine residues required 
for disulfide bond formation. Purple shading denotes a host of 
additional residues that contribute to the enzymatic activity of the 
molecule by hydrogen bonding, substrate binding or stabilization 
of reaction intermediates. Red shading indicates three regions 
found only in the a-chain (HEXA), which are required for its 
specific interaction with and hydrolysis of GM2 (ganglioside). 
Finally, light blue shading highlights a large number of 
missense mutations identified in either human HEXA or human 
HEXB, that are associated with Tay-Sachs disease or Sandhoff 
disease, respectively [32]. 
(PDF) 

Figure S7 P-Mannosidases (GH Family 2). Full-length amino 
acid (aa) alignment between human (h) P-mannosidase (MANBA) 
and the Drosophila (d) homolog, CGI 2.582, from GH Family 2, 
generated with the UniProt Align program using the GenBank 
sequence accession numbers listed in Figure 2. Identical amino 
acids are marked with asterisks (*), strongly similar amino acids are 
marked with two dots (:), and weakly similar amino acids are 
marked with one dot (.). Drosophila CGI 2582 displays 34% overall 
aa identity with human P-Man and 5 1 % aa identit)' within the GH 
Family 2 TIM barrel domain [Drosophila aa343-457). Crystalliza- 
tion and site-directed mutagenesis of a P-mannosidase from 
Bacteroides thetaiotaomicron have revealed a number of critical 
residues that are highly conserved among all members of GH 
Family 2 [S48]. Yellow shading indicates the two c:atalytic 
nucleophiles. Purple shading denotes other critical residues that 
either contribute to the enzyme's catalytic activity or are important 
for substrate recognition and binding. Finally, light blue 
shading highlights four missense mutations that cause P- 
mannosidosis in humans [30]. Importantiy, all of these key 
residues are conserved in the putative Drosophila P-mannosidase 
(CGI 2582). 
(PDF) 
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